Why is the Legendre Transformation Involutive? 



Harald Skarke 

Institut fiir Theoretische Physik, Technische Universitat Wien 
"(■Sf. Wiedner Hauptstrafie 8-10, A-1040 Wien, AUSTRIA 

skarke@hep . itp . tuwien. ac . at 

(N ' 

<D 
00 



(N 



ABSTRACT 

The question posed in the title is answered in terms of a simple pictorial argument that is 
manifestly symmetric between the two functions that are Legendre transform of each other. 

S: 

The Legendre transformation is a mathematical concept of great significance to physics. 
In mechanics and field theory it provides the transition between Hamiltonian and Lagrangian 
descriptions, and in thermodynamics it relates the different potentials. Nevertheless, with 
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very few exceptions (notably [1]), it is usually introduced just along the way, leaving the 
impression of a sleight of hand. The feeling that some essential point might be missing from the 
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standard description provided the motivation for the present considerations. In the following 
the definition of the Legendre transform G(y) of a function F(x) and a simple argument for its 
involutivity will be given. 

Let us assume that the function F(x) is continuously differentiable, with a derivative 

fix) := F'{x) 

that is strictly monotonically increasing. Then the function f(x) can be inverted to g(y): 

y = f(x) x = g(y), 

and the Legendre transform of F(x) is defined as 

G(y):=[xy-F(x)]\ x=9{y) . (1) 
If one performs the same operation on G(y): 

z:=G'(y), H(z):=[yz-G(y)]\ y=h{z)l 
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where h is the function inverse to G', a very short calculation reveals that z = x, h = f and 
H = F, i.e. one has returned to the original function. This is, of course, perfectly sufficient as a 
proof of involutivity, but a physicist would prefer a more intuitive explanation, ideally in terms 
of geometry. The standard geometric interpretation of the Legendre transform proceeds by 
considering the graph of the convex function F(x) and its tangents. This is a correct pictorial 
account of formula (pQ) which can be used to give a geometric proof (see, e.g., Arnold [2]), but 
it does not make the symmetry between F, f and x on one side and G, g and y on the other 
side manifest. Let us therefore look at the graph of the monotonic function f(x) instead. 



Figure 1: The graph of y — f(x) for the cases (a) x > 0, y > and (b) x > 0, y < 

We first assume that x and f(x) are positive; see the first diagram of Figure 1 which depicts 
the graph of f(x) over x. The same plot can be interpreted as the graph of g(y) with respect to 
the y-axis. Expressed in a symmetric manner, the figure shows the locus of all pairs (x, y) with 
y = f(x) or, equivalently, x = g(y). Now consider the rectangle bounded by the coordinate 
axes and their parallels through such a point (x,y). The area of that rectangle is A = xy, 
and the graph cuts it into two parts with areas F and G, respectively. From the figure it is 
immediately obvious that 



with xq = if the graph intersects the y-axis in yo > 0, and yo = if the graph intersects the 
x-axis in xq > 0. Clearly F is a function of x with F'(x) = f(x) = F'(x), hence 




y=f(x) 








F + G = xy 



F(x) = F(x) + c, G(y) = G(y) - c 
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for some real constant c. So F is, up to a constant, the area under the graph of /, and G is, 
up to minus that constant, the area under the graph of g, and the symmetry is manifest. 



What if our assumptions x > 0, y > are not satisfied? For x < 0, y < the argument 
is essentially unmodified since (—x)(—y) = xy. If xy < consider the second part of Figure 
1. Here we have fixed two arbitrary constant values xq, yo in such a way that xq > x > 0, 
yo < y < for the range of pairs (x, y) that we want to consider. Denote by A the area 
determined by the coordinate axes, the vertical line through xq, the horizontal line through y Q 
and the graph. Then we have 



Up to the constant Aq that can be absorbed in the redefinitions from F to F and from G to 
G, F and G again add up to xy. 

The fact that the present picture requires redefinitions of functions by constants is directly 
related to the interpretation of F(x) and G(y) as integrals of f(x) and g(y), respectively. As 
always, integrals are well-defined only up to equivalences of the type F ~ F, with '~' meaning 
'equal up to a constant function'. This fits precisely with the physical interpretation, where 
the predictions do not change if quantities like the Hamiltonian or thermodynamic potentials 
are redefined by constants. 

Note added after completion: I first presented this material in informal talks on March 
15, 2012 in Vienna and on June 4, 2012 in Heidelberg. After writing it up in the form of the 
present manuscript, I became aware of [3] which is dated June 29, 2012 (submission) / August 
22, 2012 (publication) and has some overlap in content. I am grateful to Johanna Knapp for 
pointing out this reference to me. 
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